Redesigning Case Retrieval to Reduce Information Acquisition Costs

نویسندگان

  • Vijay S. Mookerjee
  • Michael V. Mannino
چکیده

Retrieval of a set of cases similar to a new case is a problem common to a number of machine learning approaches such as nearest neighbor algorithms, conceptual clustering, and case based reasoning. A limitation of most case retrieval algorithms is their lack of attention to information acquisition costs. When information acquisition costs are considered, cost reduction is hampered by the practice of separating concept formation and retrieval strategy formation. To demonstrate the above claim, we examine two approaches. The first approach separates concept formation and retrieval strategy formation. To form a retrieval strategy in this approach, we develop the CRlc (case retrieval loss criterion) algorithm that selects attributes in ascending order of expected loss. The second approach jointly optimizes concept formation and retrieval strategy formation using a cost based variant of the ID3 algorithm (ID3c). ID3c builds a decision tree wherein attributes are selected using entropy reduction per unit information acquisition cost. Experiments with four data sets are described in which algorithm, attribute cost coefficient of variation, and matching threshold are factors. The experimental results demonstrate that (i) jointly optimizing concept formation and retrieval strategy formation has substantial benefits, and (ii) using cost considerations can significantly reduce information acquisition costs, even if concept formation and retrieval strategy formation are separated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Medical Informatics: Concepts and Applications

Medical Informatics is a developing body of knowledge concerned with the use of information and communication technology in support of medical research, education and also for promoting health care delivery. The field focuses on the biomedical information, patient data, and also acquisition, storage, retrieval and optimal use of information for problem solving and decision making. The goal of m...

متن کامل

A Domain Speci c Lexicon Acquisition Tool for Cross Language Information Retrieval

With the recent enormous increase of information dissemination via the web as in centive there is a growing interest in supporting tools for cross language retrieval In this paper we describe a disclosure and retrieval approach that ful lls the needs of both information providers and users by o ering fast and cheap access to a large amounts of documents from various language domains Relevant in...

متن کامل

A domain Specific Lexicon Acquisition Tool for Cross-Language Information Retrieval

With the recent enormous increase of information dissemination via the web as incentive there is a growing interest in supporting tools for cross-language retrieval. In this paper we describe a disclosure and retrieval approach that fulllls the needs of both information providers and users by ooering fast and cheap access to a large amounts of documents from various language domains. Relevant i...

متن کامل

Modeling storage and retrieval of memories in the brain

We have proposed a neural network model that stores the incoming information after orthogonalizing it in the same manner as vectors are orthogonalized. The scheme enables the brain to compare a new informational system with those in the memory and store its similarities and differences with the old memories in an economical manner. This allows the brain to have an enormous capacity and yet the ...

متن کامل

Optimization of Waste Collection System Using Underground Containers with Source Separation Plan (Case Study: District 3 of Yazd Municipality, Iran)

Introduction: Optimization of waste collection systems can reduce waste management costs. In this study, optimization of the waste collection system of district 3 of Yazd municipality of Iran has been investigated using underground containers. Materials and Methods: In this research, after collecting information and performing field inspections, the statistical and raster information obtained ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Information Systems Research

دوره 8  شماره 

صفحات  -

تاریخ انتشار 1997